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Tumour-associated antigen (B345) 

The present application claims the benefit, under 35 
U.S.C. § 119;. of the earlier filing dates of German 
Patent Application No. DE 100 33 080.0, filed July 7, 
2000; German Patent Application No. DE 101 19 294.0, 
filed April 19, 2001; U.S. Provisional Application No. 
50/243,158, filed October 25, 2000; and U.S. 
Provisional Application No. 60/297,747, filed June 14, 
2001. The contents of each of these applications are 
entirely incorporated herein by reference. 

The invention relates to the chemotherapy of tumour 
diseases . 



Normal body cells are subject to a strictly regulated 
system which controls growth, cell division and the 
dying off of certain cells. Thus, an adult's body cells 
divide only when they are replacing dead cells or 
healing an injury. Cancer cells, on the other hand, 
carry on growing out of control, they accumulate and 
form a tumour. When a tumour reaches a critical size, 
cancer cells may be transported through the bloodstream 
or lymphatic system into other parts of the body and 
form colonies there (metastases) . Not all tumours are 
cancerous, as benign tumours do not metastasise and are 
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therefore not usually life-threatening as they can be 
surgically removed- More detailed information on this 
subject and the other aspects of tumour formation 
discussed hereinafter can be found in the following 
publications: Rauscher and Vogt, 1997; Kastan, 1997; 
Hesketh, 1995; Pusztai, Lewis and Yap, 1995. 

The transformation of a healthy cell into a cancer cell 
may be triggered by a whole range of factors, such as 
environmental influences, radiation, viruses or 
chemical reagents. However, epigenetic modifications 
(methylations, acetylations and altered chromatin 
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structure) and genetic modifications (point mutation, 
deletion, amplification, translocation) also play a 
crucial role in tumour development. 

Mutations in coding regions of genes which are involved 
5 in regulating cell proliferation may contribute to the 
conversion of a normal cell into a tumour cell, as the 
transformed cell has growth advantages over its healthy 
neighbouring cell , 

Cancer is therefore produced by an accumulation of 
10 inherited or acquired mutations in critical 
protooncogenes or tumour suppressor genes. 

Cell proliferation is under the control of various gene 
systems, whereas products of oncogenes are involved in 
signal transmission from the cell surface to the cell 
15 nucleus, cyclin-dependent protein kinase and the 

inhibitors thereof accompany the cell through the cell 
cycle. Not infrequently, disorders in the synthesis of 
these proteins are found in tumour cells. The p53 
protein plays a crucial role. 

20 Proteins of the RB protein type regulate the 

availability of critical transcription factors. 

The genes which are highly regulated in tumour tissues 
are usually the starting point for more detailed 
analyses and as proteins of all kinds of functions are 
25 highly expressed, the approach for therapeutic 

interventions may take many forms. The objective of 
cancer research is therefore to find new target 
molecules (so-called targets) for therapeutic 
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interventions, which can then be used for a targeted 
therapy with few side effects. 

The primary objective is therefore to detect molecular 
changes between normal tissue and tumour at the level 
5 of gene expression ("transcription level"), which 

should identify new targets, on the one hand, and may 
be used to develop or find substances for inhibiting 
malfunctions, on the other hand. 

A whole range of different methods of identifying and 
10 characterising new targets which form the starting 

point for developing new therapeutic agents are based 
on drawing up differential mRNA transcription profiles 
between tumours and normal tissues. These include 
differential hybridisation, establishing subtractions 
15 cDNA banks (''representational difference analysis"; 

Hubank and Schatz, 1994; Diatchenko et al . , 1996) and 
the use of DNA chip technology or the SAGE method 
(Velculescu et al., 1995). 

As well as immunotherapeutic approaches, targeted 
2 0 chemotherapy plays an essential role in the treatment 
of cancer. By chemotherapy is meant the administration 
of substances which have either a cytostatic or 
cytotoxic/cytolytic effect as a result of interfering 
with the metabolism, signal transduction and cell • 
25 division processes of malignant cells. Chemotherapeutic 
agents can be divided into various categories on the 
basis of influencing specific targets in the tumour 
cell, the type of cellular interaction and interaction 
with a specific phase of the cell cycle. 
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The type of cancer treatment depends on the stage of 
the tumour^, the critical point being whether metastases 
are already present and how far they have spread 
through the body. The administration of cell toxins for 
5 cancer treatment, as well as surgical intervention and 
radiotherapy, is an integral part of present-day 
therapeutic concepts in oncology. 

Chemotherapy has essentially two main aims: the primary 
one is to cure cancer; this means that the tumour 
10 disappears and does not return. If a cure is no longer 
possible for various reasons, attempts are made to 
restrict or control the growth and spread of the 
tumour . 

In principle, substances used in chemotherapy are 
effective in all dividing cells. Tumour cells, however, 
are more sensitive to chemotherapeutic agents than, 
healthy cells, as it is mainly strongly proliferating 
cells which are attacked. 

Every tissue has its own growth characteristics, 
including cell division, stoppage of growth, 
differentiation and ageing, which are affected and 
regulated by internal and external factors. 

Many of the cytotoxic chemotherapeutic agents currently 
used are effective only on proliferating cells (not in 
25 the GO phase of cell division) . Both normal and cancer 
cells are attacked. The destruction of normal cells may 
lead to severe side effects; e.g. destruction of the 
blood cell-producing tissues of the bone marrow 
(myelosuppression) . 
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Chemotherapeutic agents are divided into various 
categories depending on how they affect specific 
substances within the tumour cell;, the cellular 
processes with which the drugs interact and the phase 
5 of the cell cycle which they influence. This 

information is necessary to oncologists in order to 
decide which preparations can be combined with one 
another in the therapy. 

The highly regulated genes in tumour tissues are thus 
10 potential new target structures and^ as proteins of all 
kinds of functions are highly expressed^ this is a very 
versatile approach for therapeutic interventions. 

The goal of cancer research is therefore to find new 
targets for therapeutic interventions which can then be 
15 used for targeted therapy with fewer side effects^ 

compared with the therapeutic agents currently used. 

In tumour tissues highly regulated genes are points of 
attack and hence potential target structures for 
chemotherapy . 

20 The problem of the present invention was to provide a 
new protein^ preferably expressed by tumour cells, 
which is a target molecule for intervention using 
chemotherapeutic methods. 

This problem was solved by first producing a cDNA 
25 subtraction library, using RDA ("representational 

difference analysis") between a lung-adenocarcinoma 
cell line (A549) and normal lung tissue. In order to 
select the antigens overexpressed in the tumour the 
cDNA clones obtained were then sequenced and compared 
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with sequences available in data banks. Among the 
annotated genes^. there were 321 unknown ones, for which 
there were in most cases ESTs ("expressed sequence 
tags") -entries in the data bank. After further 
5 qualitative PGR analysis in cDNA libraries of critical 
normal tissues and immunoprivileged tissues and more 
detailed data bank searches the number of candidate 
clones was restricted to 59, the ESTs of which do not 
come from critical normal tissues. 

10 These clones were spotted onto Incyte DNA chips and 
hybridised with a whole range of tumour tissues and 
normal tissues as a reference. The mRNA expression 
profiles of EST fragments which are differentially* 
expressed in cancer tissues and normal tissues and 

15 belong to an as yet unknown gene were verified using 
various methods . 

The length of the transcripts was determined by 
Northern blot analysis and the expression pattern in 
different cell systems was exactly characterised by 

20 quantitative PGR. Only unknown genes or ESTs with 

tumour-specific expression profiles were followed up 
and subjected to "full length cloning". Potential ORFs 
("open reading frames") are converted into the 
corresponding amino acid sequence and analysed for any 

25 possible prediction of function by in silico 
strategies. 

Human B345-cDNA was cloned, and the sequence obtained 
in a first attempt at cloning is shown in SEQ ID N0:1. 
Sequence analysis of the human B345-cDNA cloned on this 
30 occasion showed a continuous open reading frame from 
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position 215 to position 2461 (excluding stop codon) 
which^ at the nucleotide and protein level, shows no 
homology or identity to the known sequences of the data 
banks. The data obtained from the Northern Blot 
5 experiments lead one to conclude that the B345 

transcript is about 6.5 kb long. In a first attempt, a 
B345-CDNA with 5897 bp (excluding the polyA region) was 
obtained as the cloned region, while the presence of a 
polyadenylation signal and the PolyA tail at the 3 '-end 

10 of the sequence indicated that the cDNA is complete in 
this region. Because no continuous reading frame 
appeared in the 5 ' region of the cloned cDNA from 
position 1 to 214, it was initially assumed that the 
ATG at position 215, which also corresponds 75% to a 

15 Kozak translation initiation site (ACCATGT) (Kozak, 
1987), is the start codon of B345. 

In another cloning attempt, additional information 
about the sequence of B345 located further upstrearia was 
obtained by a standard method of molecular biology, 
20 specifically by so-called "Promoter Finder DNA 
Walking" . 

Thus, the B345 sequence (SEQ ID N0:1) obtained in the 
first cloning attempt was expanded in the 5' region. 
The start of transcription was located precisely using 

25 primer extension analysis and is found at position 201 
(SEQ ID NO: 3). By repeated sequencing in the 3' region 
too, an additional base was found at position 2430, 
compared with the sequence shown in SEQ ID N0:1, which 
leads to a shift in the reading frame and thus 

30 displaces the stop codon from position 2729 to 2791. 
The cDNA obtained (SEQ ID NO: 3) has an open reading 
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frame which codes for a potential protein with a length 
of 836 amino acids (SEQ ID NO: 4). The translation 
initiation site at position 283 corresponds roughly 70% 
to a Kozak consensus sequence. 

5 The promoter region 200bp upstream of the presumed 

transcription start site contains neither a TATA nor a 
CC7\AT box, but does contain a clear GC box, which is a 
binding site of the SPl protein. The fact that the GC 
content in the 5' region is more than 60% indicates a 
10 CpG Island (Bird, 1986) . 

The resulting primary amino acid sequence of B345 is 
shown in SEQ ID NO: 4. Analysis of the hydrophilicity 
plot of the amino acid sequence shows that the B345 
protein has two characteristic hydrophobic domains 
15 which represent a signal peptide and a helical 

transmembrane domain (Fig. 6) . This polarised structure 
indicates that B345 is an integral membrane protein. 

The extracellular domain leads one to conclude that 
there is definitely one and possibly three CUB domains. 

20 CUB domains occur in various proteins, generally ones 

which have been regulated during the development of the 
embryo. A recent publication (Gerstein et al . , 2000) 
demonstrates that proteins containing CUB domains are 
the most markedly def f erentially regulated proteins in 

25 C, elegans. Since genes which play a key role in 

embryonic development perform corresponding functions 
in cancer, e.g. in cell division, cell proliferation or 
signal transmission, it can be assumed that 
overexpression of B345 in cells causes changes in the 

30 properties of substrate adhesion or the extracellular 
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matrix. The B345 protein has 12 potential N- 
glycosylation sites which can be found in the presumed 
extracellular domain . 

In view of its amino acid sequence it can be assumed 
5 that the B345 protein constitutes a 15-Sheet secondary 
structure, as it is known that CUB domains fold into a 
^h sandwich. 

The intracellular domain (section 691-836) has no 
significant homologies. However, the whole C-terminus 
10 showed 82% identity over 124 amino acids with an EST 
(Acc No. AW063026) of human ovarian cancer cells. 

Starting from the functions of other proteins 
containing CUB domains it can be concluded that th^ 
B345 transmembrane protein plays a part in the 

15 communication, interaction and/or the signal 

transduction with extracellular components or ligands. 
Moreover, the data of the expression analysis give a 
strong indication that B345 is implicated in the 
metastatic process of cancer, particularly cancer of 

20 the large intestine. 

To clarify the physiological function and role of B345 
in metastasis, the following methods of investigation 
may be used: 

First, cell lines, preferably human cell lines, which 
25 do not endogenously express B345, were identif led, , e . g , 
using TaqMan PCR. The cells were transfected with a 
plasmid which contains the B345 sequence and B345 was 
expressed. Changes in the morphology and/or migration 
characteristics, found e.g. by soft agar assay 
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(Hamburger and Salmon, 1977) or migration assay (Liaw 
et al; 1995) of the B345 expressing cells compared with 
the untransf ected cells, indicate that B345 plays a 
part in the biological process responsible. This is a 
clear indication of the involvement of B345 in the 
interaction of tumour cells with one another and/or 
with the extracellular matrix and hence of a function 
in metastasis. 

Alternatively or in addition to this functional 
analysis, in a complementary approach, the expression 
of B345 in cells which express this protein 
endogenously was suppressed in order to extablish, 
again, any changes in the morphology or migration 
characteristics . 

In addition, an investigation is optionally carried out 
to discover whether there are protein components which 
interact with B345 inter- or extracellularly (e.g. 
using the Yeast Two Hybrid System (Fields and Song, 
1989) . 

Thus, in a first aspect, the invention relates to a 
tumour-specific polypeptide designated B345 with the 
amino acid sequence shown in SEQ ID NO: 4 or a 
polypeptide which is coded by a polynucleotide which 
hybridises under stringent conditions with a 
polynucleotide of the sequence shown in SEQ ID NO: 3 or 
a partial sequence thereof, as well as protein 
fragments or peptides derived therefrom. 

In another aspect the present invention relates to an 
isolated DNA molecule coding for the tumour-specific 
polypeptide designated B345. 
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Preferably, the DNA molecule according to the invention 
is a polynucleotide having the sequence shown in SEQ ID 
NO: 3 or a fragment thereof or a DNA molecule which 
hybridises under stringent conditions with a DNA 
5 molecule of the sequence shown in SEQ ID NO: 3 or a 
partial sequence thereof, or a fragment thereof. 

By "stringent conditions" is meant, for example: 
incubation overnight at 65°C - eS^'C with 6xSSC (IxSSC = 
150 mM NaCl, 15 mM Tri-sodium citrate), 5xDenhardt's 
10 solution, 0.2%SDS, 50 jag/ml salmon sperm DNA, followed 
by washing twice for 30 min with 2xSSC, 0.1% SDS at 
65°C, once for 30 min with 0.2xSSC, 0.1% SDS at 65*'C 
and optionally finally rinsing with O.lxSSC, 0.1%SDS at 
65°C, or equivalent conditions. 

15 The DNA molecules according to the invention code for 
(poly) peptides designated B345 with the amino acid 
sequence shown in SEQ ID NO: 4 or for protein fragments 
or peptides derived therefrom; thus, DNA molecules or 
fragments which contain deviations from the sequence 

20 shown in SEQ ID NO: 3 as a result of the degeneration 
of the genetic code are also included. 

In one embodiment, the invention relates to an isolated 
DNA molecule of the sequence shown in SEQ ID NO: 3 or a 
fragment thereof, or a DNA molecule which hybridises 
25 with a DNA molecule having the sequence shown in SEQ ID 
NO: 3 or with a partial sequence thereof, coding for the 
natural B345 polypeptide or for a fragment thereof. 

The B345 DNA molecules may be used in a so-called DNA 
vaccine for the immunotherapy of tumours. 
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The B345 DNA molecules according to the invention, 
preferably in recombinant form as plasmids, may be 
administered directly or as a component of a , 
recombinant virus or bacterium. In theory, any gene- 
therapeutical method for the immunotherapy of cancer 
based on DNA (DNA vaccines) may be used on B345-DNA, 
both in vivo and ex vivo. 

Examples of in vivo administration are the direct 
injection of "naked" DNA, either by intramuscular route 
or using a gene gun, which has been shown to lead to 
the formation of CTLs against tumour antigens. Examples 
of recombinant organisms are vaccinia virus, adenovirus 
or listeria monocytogenes (a summary was provided by 
Coulie, 1997) . Moreover, synthetic carriers for 
nucleic acids such as cationic lipids, microspheres, 
micropellets or liposomes may be used for in vivo 
administration of nucleic acid molecules coding for 
B345 peptide. Different adjuvants which enhance the 
immune response may also be administered, e.g. 
cytokines, either in the form of proteins or plasmids 
coding for them. The application may optionally be 
combined with physical methods, e.g. electroporation . 

An example of ex vivo administration is the 
transfection of dendritic cells as described by Tuting, 
1997, or other APCs which are used as cellular cancer 
vaccine . 

Thus, according to another aspect, the present 
invention relates to the use of cells which express 
B345, either per se or, in optionally modified form, 
after transfection with the corresponding coding 
sequence, in order to produce a cancer vaccine. 

Alternatively to the natural B345-cDNA or fragments 
thereof, modified derivatives may be used. These 
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include sequences with modifications which code for a 
protein (fragment) or peptides with greater 
immunogenicity; the same considerations apply to the 
modifications at the DNA level as to the peptides 
5 described above. Another type of modification is the 
lining up of numerous sequences^ coding for 
immunologically relevant peptides, in the manner of a 
string of beads (Toes et al., 1997). The sequences • may 
also be modified by the addition of auxiliary elements, 

10 e.g. functions which ensure more efficient release and 
processing of the immunogen (Wu et al., 1995). For 
example, by the addition of a locating sequence in the 
endoplasmatic reticulum (''ER targeting sequence'') the 
processing and hence the presentation and, lastly, the 

15 immunogenicity of the antigen can be increased. 

In another aspect the present invention relates to a 
recombinant DNA molecule which contains B34 5 DNA, e.g. 
connected to a regulatory DNA sequence, particularly a 
heterologous regulatory DNA sequence, e.g. a promoter 
20 or enhancer. 

» 

In another aspect the present invention relates to 
antibodies against B345 or fragments thereof. 
Polyclonal antibodies may be obtained in conventional 
manner by immunising animals, particularly rabbits, by 
25 injecting the B345-antigen or fragments thereof, and 
then purifying the immunoglobulin. 

Monoclonal anti-B345-antibodies may be obtained by 
standard procedures according to the principle 
described by Kohler and Milstein, 1975, by immunising 
30 animals, particularly mice, then immortalising 
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antibody-producing cells of the immunised animals, e.g. 
by fusion with myeloma cells, and screening the 
supernatant of the resulting hybridomas by standard 
immunological assays for monoclonal anti-B345- 
antibodies. For therapeutic or diagnostic use in humans 
these animal antibodies may optionally be chimerised 
(Neuberger et al., 1984; Boulianne et al . , 1984) or 
humanised (Riechmann et al., 1988; Graziano et al., 
1995) in the conventional manner. 

Human monoclonal anti-B345-antibodies (fragments) may 
also be obtained from so-called "Phage Display 
Libraries'' (Winter et al . , 1994; Griffiths et al.,» 
1994; Kruif et al., 1995; McGuiness et al . , 1996) and 
using transgenic animals (Bruggemann et al., 1996; 
Jakobovits et al., 1995). 

The anti-B345-antibodies according to the invention may 
be used in immunohistochemical analyses for diagnostic 
purposes, or as therapeutic agents in cancer therapy. 
(One example of the successful use of a monoclonal 
antibody in cancer therapy is herceptin; an antibody 
against the proto-oncogene HER2 . Herceptin can be used 
in breast cancer patients exhibiting an overexpression 
of HER2 . ) 

According to another aspect the invention relates to 
the use of B345-specif ic antibodies for selectively 
delivering any desired substances to or into a tumour 
which expresses B345. Examples of such substances are 
cytotoxic agents or radioactive nuclides, the effect of 
which is to damage the area surrounding the tumour. 
Because of the relatively tumour-specific expression of 



B345 only mild side effects are to be expected. In 
another aspect substances for visualising tumours which 
express B345 may be used with the aid of B345- 
antibodies. This is useful for the diagnosis and f6r 
evaluating the course of therapy. Therapeutic and 
diagnostic uses for antibodies which may be used as 
anti-B345 antibodies are described in WO 95/33771. 

The protein designated B345 according to the present 
invention and the protein fragments, peptides or 
peptide equivalents or peptidomimetics derived 
therefrom may be used in cancer therapy, e.g. to induce 
an immune response to tumour cells which express the 
corresponding antigen determinants. They are preferably 
used for the treatment of B345-positive tumours, 
particularly in lung and colon carcinoma. 

B345 or peptides, peptide equivalents and 
peptidomimetics can be used for the immunotherapy of 
cancer, as described e.g. in WO 00/73438, to the 
disclosure of which reference is hereby made. 

It is known that tumour-associated antigens may have 
tumour-specific mutations which contribute to an 
immunological differentiation between tumour and normal 
tissue (Mandruzzato et al., 1997; Hogan et al., 1998; 
Gaudi et al., 1999; Wolfel et al., 1994). In order to 
detect the presence of tumour-specific B345 mutations, 
appropriately using probes of the isolated cDNA 
according to the invention, the B345-cDNA from one or 
more different tumours is cloned and the sequences 
obtained are compared with normal tissue-B345-cDNA! 
Tests are carried out which are intended to show 
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whether tumour-B345 peptides from a section of sequence 
which is mutated compared with normal tissue-B345 have 
greater immunogenicity than normal tissue-B345 peptides 
from the corresponding section. To confirm that some 
5 mutations are tumour-specific, antibodies against these 
regions may be generated and tumour cells may be 
investigated for the expression of possible mutations. 

Thus, in another aspect, the present invention relates 
to B345-peptides, derived from regions of a tumour- 
10 expressed B345 which contain tumour-specific mutations. 

It can be assumed, from the preferred expression of 
B345 in tumour cells, that this protein has an 
important function for the tumour, e.g. for its 
formation, infiltration and growth and thus constitutes 
15 a target for chemotherapeutic intervention. 

With a view to its use as a target in targeted 
chemotherapy B345 is characterised in more detail so as 
to develop a suitable strategy for intervention with 
this function. 

t 

20 As the first step in the so-called "down-stream" 

functional analysis of B345, a bioinf ormatic analysis 
is conveniently carried out in a first step, to 
indicate the direction for the experimental validation 
of B345 as target. 

25 For this analysis the bioinf ormatic concepts based on 
similarity and modular structure form an essential 
basis. Established bioinf ormatic tools for detecting 
similarities are BLAST 

( http: //www. ncbi .nlm. nih. gov/BLAST , Altschul et al.. 
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1997) or FASTA (Pearson 6c Lipman, 1988), the 
specialised data banks such as Pfam 

( http: //www. Sanger. ac.uk/Pfam , Bateman et al . , 2000) 
and SMART ( http: //smart . embl-heidelberq , de , Schultz et 
5 al., 2000), which take account of domain structures. To 
refine the analysis applications such as Clustal 

( http : //www2 . ebi . ac . uk/clustalw , Higgins et al., 1996) 
HMMer ( http : //hmmer . wustl . edu ) , PSI-BLAST (Altschul et 
al., 1997) and the PROSITE data bank 
10 ( http: //www. expasy. ch/prosite , Hofmann et al., 1999) 

may be used. Statistical methods of analysis which are 
not based on homologies make it possible to predict 
other structurally and functionally relevant prope^rties 
such as the secondary structure and the occurrence of 
15 transmembrane segments and helix-turn-helix motifs. 

There are methods of predicting the secondary structure 
of proteins; Jpred 

( http: //barton. ebi . ac . uk/servers/ jpred . html , Cuff et 
al., 1998) is particularly worth mentioning. Predicting 
2 0 the secondary structure can underpin functional 

hypotheses, e.g. if the structure of the suspected 
homologue is known. 

According to bioinf ormatic analysis B345 has a helical 
transmembrane domain, both the N-terminal and the C- 

25 terminal region being hydrophilic, leading to the 
conclusion that this protein is a transmembrane 
protein. The N-terminal, extracellular region has a few 
CUB domains which have a tendency to form disulphide 
bridges and are therefore involved in dimerisation or 

30 protein-protein interactions (Bork et al., 1993). The 
C-terminal, intracellular end shows 
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homology with a receptor kinase and a C-kinase 
substrate . 

B345 is subsequently subjected to biochemical and 
biological analysis. • 

In a subsequent step the function of B345 for the 
progress of the tumour is clarified; e.g. by 
proliferation assays in vitro or in animal models which 
overexpress the B345 gene under investigation 
(constitutively or inducibly) and as a control either 
express it in deleted (inactive) form or down-regulate 
it using antisense (cf. e.g. Grosveld and Kollias, 
1992) . 

B345 may be used in screening assays to identify 
substances which modulate, particularly inhibit, the 
activity of this protein. In one embodiment an assay of 
this kind may consist, for example of introducing the 
B345 protein, or an active fragment thereof, into cells 
which react to the activity of B345 with proliferation 
or of expressing the corresponding B345 cDNA in the 
cell, and determining the proliferation of the cells in 
the presence and absence of a test substance. 

One example of test cells might be cells with a low 
division rate, e.g. primary cells containing no 
endogenous B345. To determine the suitability of the 
cells for a screening assay, the cells are transformed 
with B345-CDNA, cultured and tested using standard 
assays, e.g. the incorporation of thymidine, for their 
ability to proliferate. On the basis of a significant 
increase in their proliferation after B345 expression 
they may be used as test cells, e.g. in High Throughput 



Screening Proliferation assays. Examples of 
proliferation assays in the High Throughput format, 
e.g. based on the MTS assay, are described in 
WO 98/00713. 

Substances with a proliferation-inhibiting effect may 
be used to treat tumours with a significant B345 
expression, particularly in carcinoma of the lung and 
colon • 



20 



Summary of the Figures: 



Fig. lA: expression profile of B345, B452 and B540 in 
individual lung carcinomas and 
5 lung tumour cell lines. 

Fig. IB: expression profile of B345 in normal 

large intestine tissue and tumour cell lines. 

Fig. IC: graphic representation of the alignment of 
B345, B452 and B540 . 

10 Fig. 2A: Northern Blot analysis of the tumour cell line 
A549 using a 490bp long B345 PGR product 

Fig. 2B: Northern Blot analysis of various normal 

tissues using a 490bp long B345 PGR product 

Fig. 2C: Northern Blot analysis of various cancer 
15 tissues using a 318bp long B345 PGR product 

Fig. 3 mRNA expression analysis of B345 by 

real-time PGR of tumour and normal tissues. 

Fig. 4: mRNA expression analysis of B345 by 

real-time PGR of laser-microscope-prepared 
2 0 large bowel tumours (LGM) and normal 

large bowel tissue and tumour cell lines. 

Fig. 5: Graphic representation of the gene structure 
of B345. 

Fig. 6: Hydrophilicity and transmembrane blot of the 
25 B345 protein 
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Fig. 7: Potential protein structure of B345 



Description of the Tables 

Tab. 1: Summary of the Northern Blot data for 
5 B345 in various normal tissues (lA) f cancer 

cell lines (IB) ; and various normal tissues 
compared with the corresponding tumour tissue 
(IC) ' 

Tab. 2A: Summary of the data for the quantitative 
10 PGR of B345 in various normal and cancer 

tissues 

Tab. 2B: Summary of the data for the quantitative 

PGR of B34 5 in various normal tissues and 

» 

microdissected colon adenocarcinoma tissues 

15 

Explanation of symbols 

+++ extremely positive 

strongly positive 
+ positive 
20 {+) slightly positive 

- negative 



Example 1 



RDA ("Representational Difference Analysis") of the 
human adenocarcinoma cell line of the lung (A549) and 
normal lung tissue. 

The human lung adenocarcinoma cell line A549 (CCL 185) 
obtained from ATCC was cultured in T150 cell culture 
flasks. The nutrient medium used was MEM with 10% beat- 
inactivated, foetal calf serum and 2 mM of L-glutamine. 
Every 3 to 4 days the cells were split for propagation 
by trypsinisation 1:5 to 1:10. After about 
80% confluence had been achieved, 4 ml of a trypsin 
solution (contents per litre: 8g NaCl, 0.2g KCl, 1.13g 
anhydrous Na2HP04, 0.2g KH2PO4, 100 ml of 2.5% trypsin 
solution, Ig EDTA-Na-salt ; pH 7 . 2 - 7.4) were used per 
T150 cell culture flask to harvest the cells. The 4 ml 
were transferred into a 15 ml Falcon test tube, mixed 
with 8 ml of PBS, centrifuged at 1200 rpm in a Heraeus 
bench centrifuge (Megafuge 2. OR) for 5 min at 4°C, the 
cell pellet was mixed with 1 ml of lysing buffer (10 mM 
Tris-HCl pH8, 140 mM NaCl, 1.5 mM MgCla, 0.5% NP40) , 
shaken vigorously and centrifuged off in a 2 ml 
Eppendorf vessel at 12, 000 rpm and at 4°C for 5 min in a 
Sigma bench centrifuge (Sigma 202 MK) . The supernatant 
was transferred into a new Eppendorf vessel and after 
the addition of 55 pi of 20% SDS solution extracted 
twice with double the volume of a CHCla/phenol (1:1 v/v) 
mixture and extracted once with a single volume of 
CHCI3. The aqueous RNA-containing phase was mixed with 
1/10 volume of 3M NaAc (pH5) and twice the volume of 
9 6% EtOH and the RNA was precipitated overnight at - 
20°C. Starting from 1 mg of total RNA, the procedure for 
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isolating poly-A(+)RNA using the polyATtract Kit 
(Promega) was carried out according to the 
manufacturer's instructions. The A549 poly-A(+)RNA was 
stored in a concentration of 1 itig/ml in DEPC-treated H2O 
5 in aliquots at -80°C. 

In order to carry out representational difference 
analysis (RDA; Hubank and Schatz^ 1994; Diatchenko et 
al . 1996) the poly-A( + )RNA of the lung adenocarcinoma 
cell line A549 was used as the "tester" and that of 

10 normal lung tissue (1 mg/ml; Clontech^ Palo Alto; 

#6524-1) was used as the "driver". The RDA was carried 
out using the PCR-select™ kit (Clontech, Palo Alto) in 
accordance with the manufacturer's instructions, except 
that a modified primer /adapter-2-oligonucleotide system 

15 was used: adapter-2-alt-l (SEQ ID NO: 31) and nested- 
PCR-primer-2-alt (SEQ ID NO: 32) and adapter-2-alt-2 
(SEQ ID NO: 33) . The newly generated primer/adapter 
sequences make it possible to excise the particular 
cDNA fragments subsequently, thanks to the presence of 

20 three new restriction enzyme cutting sites (Kpn I, Sac 
I and Xho I) in the sequence of the nested-PCR-primer- 
2-alt after the subtracted cDNA fragments have been 
cloned into the pPCRII vector. It was necessary to 
design a primer /adapter sequence with a plurality of 

25 available restriction enzyme cutting sites because 

point mutations could often be observed, caused by* the 
PGR amplification steps, in the primer sequences in 
particular . 

After the synthesis of double-stranded cDNA using 
3 0 oligo-dT, the cDNA of ''tester'' and "'driver'' obtained 
was digested with Rsal (Rsal is a restriction enzyme 
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which recognises 4 bases and on a statistical average 
yields cDNA fragments which are 256 bp long) . Equal 
parts of "tester-cDNA" were ligated with either adapter 
1 or 2 and then separately hybridised with an excess of 
5 "driver-cDNA" at 65° C, Then the two mixtures were ' 
combined and subjected to a second hybridisation with 
fresh denatured "driver cDNA" . The concentrated 
"tester"-specif ic cDNAs were then exponentially 
amplified by PGR, with primers specific to the adapters 

10 1 and 2, To achieve further concentration^ one aliquot 
from this reaction was subjected to a second PGR with 
specific nested primers. The exponentially amplified 
cDNA fragments resulting from this reaction were 
ligated directly into the pCRII vector (Invitrogen; 

15 "TA-cloning vector") and then one third of the ligation 
mixture was transfected into competent E, coll 
(OneShot™, Invitrogen) . 

712 positive transf ormants (blue-white selection) were 
obtained and cultivated in 96-well blocks in LB-Amp 

20 Medium (1.3 ml per well) for 48 h at 37°C. 750 pi of the 
E. coll suspensions were used per well for the 
preparation of the plasmid-DNA (96-well QIAgen 
minipreparation method according to the manufacturer' s 
instructions) . The remaining bacterial cultures were 

25 stored as glycerol stock cultures at -SO^'C. 

A cDNA subtraction library consisting of 712 individual 
clones was obtained^r in the form of both E, coll 
glycerol stock cultures and also purified plasmids. 
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Example 2 

DNA-sequencing and annotation of TAA candidates 

The isolated plasmid-DNA of all the 712 clones (cf . 
Example 1) was sequenced by the Sanger method on an 
5 ABI-3 77 Prism apparatus. The sequences obtained were 
annotated using BioScout-Sof tware (LION, Heidelberg) 
and subjected to data bank comparisons (Genbank) . Of 
712 clones, 67 8 were able to be sequenced and 
annotated. The rest (34) either only had poly (A) 

10 sequences as an insert or corresponded to a religated 
vector or could not be sequenced. Of the 678 
annotatable sequences, 3 57 proved to be genes with a 
known function. The remaining 321 represented clones 
coding for genes with an unknown function; 5 9 of them 

15 did not even have entries in the human EST data bank. 

Known genes were not treated further. For those unknown 
genes for which an EST entry was available, the 
expression profile was evaluated: all those ESTs with 
>95% identity (BLAST) which belonged to the 

2 0 experimentally determined sequence of the subtraction 

libraries were examined. During annotation the material 
was subdivided into a) critical normal tissue, 

b) foetal, "disposable" and immunoprivileged tissue and 

c) tumours and tumour cell lines. On the basis of this 
2 5 "virtual mRNA profile" ("virtual Northern blot") 

2 00 clones for which no ESTs were found in group a) 
were selected for further experimental analyses 
(including the 59 clones for which there was no EST 
entry) . To narrow down the candidate clones still 
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further, from the sequences determined from the 2 00 
selected clones, pairs of oligonucleotide primers were 
designed and synthesised. First, 8 different cDNA 
libraries derived from human tissue (GibcoBRL 
5 "SUPERSCRIPT™"), which are direct ionally cloned in 
pCMV-SPORT, were tested by qualitative PGR for the 
presence of the particular candidates. The cDNA 
libraries used came from heart tissue (#10419-018) , 
liver (#10422-012), Leukocytes (#10421-022), kidney 

10 (#10420-016), lung (#10424-018), testis (#10426-013), 
brain (#10418-010) and foetal brain (#10662-013) . The 
PCR conditions were as follows: 20 fil of total volume 
per PCR mixture contained Ix TaqPol buf f er (50mM KCl , 10 
mM Tris-HCl pH 9, 0.1% Triton X-100) , 1.5 mM MgCl^; 0.2 

15 mM dNTPs ( Pr omega ) , 0.025 U/jUl Taq-DNA-polymerase 

(Promega) , 5 pM each of specific oligonucleotide primer 
for B345 (B345-D, SEQ ID NO: 34) and (B345-U, SEQ ID 
NO: 35) and 100 ng of the plasmid-DNA under 
investigation. Specific primers for GAPDH (SEQ ID NO: 

2 0 3 6 and 37) were used as a control. To check the 

selective detection, the relevant B345 specific primer 
pairs, oligonucleotide primer (SEQ ID NO: 34) and (SEQ 
ID NO: 35), were tested in parallel for the isolated 
plasmid with the original B345 fragment (fragment of 

25 B345 cDNA originally isolated) . The detectability of 
fragments of the expected length with a strong signal 
in one of the critical normal tissues (heart, liver, 
lung, kidney and leukocytes) , but not in the cDNA 
libraries of immunoprivileged tissues (brain, foetal 

30 brain and testis) under these PCR conditions (1 cycle: 
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3" 94°C; 35 cycles: 1" 94°C - 1" 55^C - 1" 72^C; 1 
cycle: 7" 72°C) was defined as the distinguishing 
criterion. Using this qualitative PGR analysis the 
number of candidates could be reduced to 56; clone B345 
5 was in this already preselected group of candidates. 

Example 3 

Expression analysis by cDNA chip hybridisation 

In order to design a cDNA chip a number of clones from 

10 categories having all kinds of functions ranging from 
apoptosis to cell cycle regulation were selected from 
the dBEST data bank by a nucleotide sequence search. In 
all, 1299 IMAGE clones were obtained (of which 1024 are 
known genes) and sequenced to check them. Microtitre 

15 plates with bacteria which contain approximately 80 0 bp 
long sequences from the 3" end of the gene in the , 
vector were sent to Incyte Pharmaceuticals, Inc. (USA), 
where they were spotted onto 60 chips. In addition to 
these clones, 12 0 EST clones identified by RDA were 

20 also spotted onto the chips. The DNA chips thus 

produced were then hybridised with Cy3 -labelled cDNA 
from normal tissue, tumour tissue and cell lines 
together with Cy5- labelled cDNA from a mixture of nine 
different normal tissues and the two signals were 

25 compared in order to standardise the expression values. 
The calculations were partly carried out in S-Plus or 
in Microsoft Excel . Evaluation of the chip experiments 
produced a very similar expression profile for B345, 
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B540 and B452 when hybridisation was carried out with 
lung cancer probes of cell lines and patient material 
(cf. Fig. lA) . A tumour-associated expression profile 
of this kind could also be found for B345 when colon 
adenocarcinoma was compared with normal colon tissue 
(cf . Fig. IB) . 

The sequence alignment of B345, B54 0 and B452 clearly 
showed an overlap between the individual EST fragments. 
It could therefore be assumed that the three clones are 
ESTs of one and the same gene. The resulting DNA 
section covers a length of 843 bp (cf . Fig. IC) and was 
used in further experiments to search public data 
banks. The search results shows no significant homology 
with known DNA or protein sequences, indicating that 
B345 is a hitherto unknown gene. 

Example 4 

Expression analysis of B345 using Northern Blots: 

B345 is a gene which is highly regulated in tumour 
tissues (cf. Fig. lA and IB, Tab. la and Tab. IB) 
according to DNA Chip analyses. 

In order to confer the transcription profile obtained, 
on the one hand, and determine the length of the 
expected mRNA for full size cloning, on the other hand, 
a Northern Blot analysis was carried out for B345 using 
human cell lines and the "Human Multiple Tissue 
Northern Blots" (Clontech and Invitrogen) . The probes 
used were 490 bp and 318bp long PGR products of B345 
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(primer (SEQ ID NO: 5 and SEQ ID NO: 6 or SEQ ID NO: 7 
and SEQ ID N0:8)) labelled with [a-^^P]dCTP (NEN, 
Boston). The hybridisation took place at 68 for 2'h; 
visualisation by standard autoradiography (Hyperfilm, 
5 Amersham) . Fig. 2A,2B and 2C and Tab. la and Tab. IB 
and IC show the results of this analysis: Fig. 2A the 
analysis of the cell line A549^ Fig. 2B the analysis of 
12 normal tissues (Peripheral Blood Lymphocytes (PBL) , 
lung, placenta, small intestine, liver, kidney, spleen, 

10 thymus, colon, skeletal muscle, heart and brain) and 
Fig. 2C the analysis of 8 cancer cell lines 
(promyelocytic leukaemia HL60, HeLa-S3, chronic 
myelogenic leukaemia K-5 62, lymphoblastic leukaemia 
MOLT-4, Burkitt^s lymphoma (Raji) , colon adenocarcinoma 

15 SW480, lung adenocarcinoma A549 and melanoma G361) . The 
B345 transcript is 6.5 kb long. , 

Example 5 

Analysis of the expression profile of B345 at the RNA 
20 level using quantitative RT-PCR (real time PGR or 
TagMan analysis) . 

In order to quantify more precisely the expression of 
mRNA in the various normal and tumour tissues, "real 
time PGR" was used, which makes it possible to 
25 calculate the RNA concentration compared with an 
ex t e rna 1 s t andar d . 

The RNA was isolated from frozen tissue with Trizol 
according to the instructions provided by the 
manufacturer, Gibco. To eliminate any contaminating DNA 
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the prepared RNA was digested with DNAase I as follows: 
3 jug of total RNA were incubated for 15 minutes at 37^C 

with 20 III of 5x AMV buffer (Promega) , 1 ptl of RNasin 
(Promega) and 2 [jlI of DNase I (Boehringer Mannheim) in 
5 a total volume of 8 0 ^1. 12 0 jjlI of phenol : chloroform: 
isoamyl alcohol (25:24:1) were added, mixed in a 
vortexer and briefly centrifuged. The aqueous phase was 
removed, 120 jil of chloroform: isoamyl alcohol (24:1) 
were added and the mixture was centrifuged as before. 
10 The purified RNA was precipitated with ethanol and 
5 dissolved in water. 

^ Then the total RNA was transcribed into cDNA with 

IJj reverse transcriptase (Superscript, Gibco, BRL) : 1 jul 

■J3 of oligo dT primer (Promega) was added to 3 fxg of total 

}^ 15 RNA and made up to a final volume of 10 ptl with water. 
^ After 5 minutes' incubation at 70°C the solution was 

M cooled for 5 minutes at room temperature. 5 fil of RT 

fiJ reaction buffer (5x, Gibco, BRL), 2.5 /zl of dNTPs 

(10 mM of each, Boehringer Mannheim) , 1 /zl of RNasin 
20 (10U///1, Promega), 1.5 jLzl of Superscript (10 U//^l, 
Gibco, BRL) and 5 (jlI of water were added and the 
mixture was incubated for 1 hour at 42 and the 
reaction was stopped by incubating for 3 minutes at 
95°C. 

2 5 In order to prepare a cDNA pool of a specific type of 
tissue or tumour, 3 to 10 different individual 
preparations from different patients were mixed 
together in equal proportions. 
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The quantitative measurement of the "household genes" 
S-actin, GAPDH and tubulin in cDNA pools was carried 
out as follows : 

A) E-Actin-TaqMan PGR (Perkin Elmer) 

5 For details of the principle of the TaqMan method of, 
manufacturer's information (Perkin Elmer). A TaqMan PGR 
run contained samples of li-actin control sequence with 
10^, 10^ 10% 10^ and 10^ copies/]al (Perkin Elmer) to 
determine the standard curve, a negative control 

10 without DNA and the cDNA pools which are to be 
quantified. All the samples were analysed in 
triplicate. For a 25 ]il reaction mixture, 1 \il of cDNA, 
2.5 |il of lOx buffer A (Perkin Elmer), 4 ]al of MgCl2 
(25 mM, (Perkin Elmer)), 0.5 pi of each nucleotide (10 

15 mM of dATP, dCTP, dGTP; 20 mM of dUTP) , 0.125 \il of 

TaqMan probe (20 pM; TaqMan probe for i5-actin (SEQ-ID 
NO: 20 fluorescence-labelled with 6-carboxyf luorescein 
at the 5* end and with 6-carboxytetramethylrhodamine at 
the 3' end), 1 pi of each 15-actin-specif ic primer (20 

20 pM each of Forward primer SEQ-ID NO: 21 and Reverse 
primer SEQ-ID NO:22), 0.25 pi of AmpErase uracil 
N-glycosylase ^'UNG'^ (1 U/pl, Perkin Elmer), and 
0.125 pi of AmpliTaq Gold (5 U/pl, Perkin Elmer) were 
mixed together, transferred into MicroAmp Optical Tubes 

25 (Perkin Elmer) and sealed with MicroAmp Optical Gaps. 
The PGR was carried out as follows: one cycle of 2 
minutes at 50 °G for the UNG reaction, one cycle of 10 
minutes at 95°G to activate the AmpliTaq, 40 cycles 
each of 15 seconds at 95° and 1 minute at 60 °C. Then 

30 the probes were kept at 25°C. The data was evaluated 
using the "Sequence Detection System 1.5bl" programme 
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(PE Applied Biosystems) , basically by comparing the 
fluorescence signals of the cDNA probes being 
quantified with the signals of the control plasmid 
dilutions of known concentration. 

5 B) GAPDH-TaqMan PGR 

For quantifying GAPDH, which was used like fi-actin or 
tubulin to standardise the RNAs used, the following 
primers or probes were used. The TaqMan probe used for 
GAPDH was a probe (SEQ-ID NO: 23) labelled at the 5' 
10 end with tetrachlorof luorescein and at the 3 ' end with 
carboxymethylrhodamine (Forward GAPDH primer: SEQ-ID 
NO: 24 and Reverse primer: SEQ-ID NO: 25) . The 
reactions were carried out as described above. 

C) Tubulin-SybrGreen PGR (Perkin Elmer) 

15 For the principle of SybrGreen PGR cf . the 

manufacturer's information (Perkin Elmer). A SybrGreen 
PGR run contained samples of tubulin control plasmid 
with 10^, 10^ 10\ 10^ and 10^ copies/jLil (Perkin Elmer) 
to determine the standard curve, a negative control 

20 without DNA and the cDNA pools which are to be 
quantified. All the samples were analysed in 
triplicate. For a 25 pi reaction mixture, 1 \xl of cDNA, 

2.5 lal of lOx SybrGreen buffer (Perkin Elmer), 3,5 jxl 
MgGl2 (25 mM, Perkin Elmer) ), 0.5 jtxl of each primer 
2 5 (2 0 /zM each, Perkin Elmer, Tubulin Forward (SEQ-ID 
NO:26) ; Tubulin reverse (SEQ-ID N0:27), 0.25 jul of 
AmpErase uracil N-glycosylase "UNG" (1 U//xl, Perkin 
Elmer), and 0.25 jil of AmpliTaq Gold (5 U//xl, Perkin 



Elmer) were mixed together, transferred into MicroAmp 
Optical Tubes (Perkin Elmer) and sealed with MicroAmp 
Optical Caps. The PCR was carried out as follows: one 
cycle of 2 minutes at 50 °C for the UNG reaction, one 
5 cycle of 10 minutes at 95°C to activate the AmpliTaq, 
40 cycles each of 15 seconds at 95° and 1 minute at 
60°C. Then the probes were kept at 25°C. The data was 
evaluated using the "Sequence Detection System 1.5bl" 
programme (PE Applied Biosystems) , basically by 
10 comparing the fluorescence signals of the cDNA probes 
being quantified with the signals of the control 
plasmid dilutions of known concentration. 

D) B345-TaqMan PCR 

The quantitative TaqMan-PCR analysis of B345 was 

15 carried out as described for the "household genes". 

However, B345 specific primers (SEQ ID MO: 28 and SEQ ID 
NO:29) (200 ng//xl) and a B345 specific probe (SEQ ID 
NO: 30, 2 0 /^M) labelled at the 5' end with 
tetrachlorof luorescein and at the 3 ' end with 

2 0 carboxymethylrhodamine were used. The PCR product of 
B34 5 with the primers SEQ ID NO: 2 8 and SEQ ID NO:* 29 
with a known copy number was used as the standard. 

Fig. 3 shows the TaqMan expression analysis (Fig. 3A; 
IS-actin; Fig. 3B: tubulin). It was found that B345 is 
2 5 expressed more highly in large bowel cancer tissue than 
in normal tissue (cf . Tab. 2a) . However, both the 
normal tissue and the tumour tissue constitute a very 
heterogeneous mixture of different cell types. 
Furthermore^ the proportion of tumour cells in the 



34 



tumour tissue varies considerably from about 30 to* 80%. 
In order to minimise this biological heterogeneity, the 
epithelial cells of the large intestine, which are the 
cells of origin of adenocarcinoma, and cancer cells or 
5 cancer regions were specifically prepared by laser 
microdissection. Tissue sections lOjum thick were 
prepared using a Leica, Jung CM18 0 0 cryomicrotome and 
placed on a polyethylene-coated slide (Bohm et al . , 
1997) . The sections, dried at ambient temperature for 

10 about 3 0 minutes, were incubated with Mayers 

haematoxylin (SIGMA DIAGNOSTICS) and then washed under 
running water for five minutes to remove any non- 
specifically bound dye. After 5 minutes' drying at 
37*^C, the laser microdissection was carried out. This 

15 was done using the PALM laser microscope (PALM GmbH, 
Bernried, Germany) and about 2000 to 5000 cells were 
prepared. The cDNA obtained by Reverse Transcription 
was analysed by Real Time PGR here too. The results 
show that the B345 expression in large intestine 

20 carcinoma cell lines and in patient material is 

comparatively many times higher than that of the normal 
tissue of the large intestine. For standardisation, the 
expression level of GAPDH was determined (cf . Fig, 4 
and Tab. 2B) . 

25 

Example 6 



a) Cloning of the cDNA of B345 
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Searching through data banks for sequences of gene 
fragments (ESTs^ expressed sequence tags) which can be 
used for the iii silico" cloning of B345 yielded an 
overlapping EST contig of about 1500 bp. The polyA 
5 region at one of the ends indicated the orientation of 
the DNA section in relation to 5 ^ - 3 ^ orientation; 
which is essential when designing new primers for the 
amplification of B345-specif ic cDNA fragments. 

First, the potential 3 end described by the data bank 
10 analysis was verified by experimental approaches. RNA 
from the lung carcinoma cell line Calu 6 (AACC No. 
HTB56) was reverse transcribed using the primer (SEQ ID 
NO: 9) and the resulting single-strand cDNA was 
amplified by PGR with the gene-specific primer SEQ ID 
15 NO: 5 and the adaptor primer SEQ ID NO: 10. 

For a 25 />tl PGR mixture, 1 /xl of the cDNA pool was 
mixed in water with 2.5 /xl of lOxTaq buffer (Promega) , 
1.5 ^1 of MgGl^ (25 mM, P r omega ) , 0.5 j^l of dNTPs (10 mM 
each, Boehringer Mannheim) , 1 /xl of primer mixture (2 0 
20 /UM each), 0.15 jil of Taq polymerase (Promega) . The PGR 

was carried out as follows: Ix 94*^0 for 3 minutes; 3 Ox 
94''G for 30 seconds, 55''C for 30 seconds, 72°C for 1 
minute; kept at 4°G. The PGR was analysed on a 1.2% 
agarose gel , 

25 The two primers were then used to sequence the purified 
PGR product. The sequences found showed high homology 
with the DNA fragment cloned in silico" (including the 
polyA tract) . 
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Since the cloning of 5* terminal sequences is usually a 
very laborious processor various methods were used below 
to solve the problem. 

Here again^ Calu 6 was used as the starting cell line. 
5 After reverse transcription of the RNA with the primer 
SEQ ID NO: 9 and synthesis of the second strand, a 
linker consisting of the two oligos SEQ ID NO: 11 and 
SEQ ID NO: 12 was ligated onto the double stranded cDNA 
(Abe et al., 1992). The resulting LoneLinker cDNA 

10 library was then amplified with the gene-specific » 
primer SEQ ID NO: 6 in linear manner over 35 cycles. 
One aliquot of the B34 5-enriched cDNA could then be 
further amplified with the primers SEQ ID NO: 13 and 
LLEcoRIA SEQ ID NO: 11. After gel electrophoresis of 

15 one aliquot and Southern analysis with the gene- 
specific oligo SEQ ID NO: 14, it was possible to locate 
a 5 kb band. This fragment was then sequenced step by 
step and aligned with the sequence of the EST contig. 

In order to check the resulting sequence from the 
20 LLcDNA cloning, two fragments were amplified by PGR 
(SEQ ID NO: 15 and SEQ ID NO: 16 or SEQ ID NO: 6 and 
SEQ ID NO: 17) and used to screen Lambda gtlO cDNA 
phage libraries. Positive plaques were isolated and 
amplified by PGR using gtlO-specif ic primers (SEQ ID 
25 NO: 18 and SEQ ID NO : 19) . Subsequent sequencing and 

alignment with the sequences led to the assumption that 
this is a differentially spliced product. The splice 
donor, acceptor and lariat sequence were found 
subsequently. Using PGR by a suitable combination of 
3 0 primers, searches were carried out in various cell 
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lines for differential splice products; in all the cell 
lines screened, only one product was found and led^ to 
the gene structure shown in Fig. 5. The cDNA shown has 
an open reading frame (ORF) which codes for a potential 
5 protein 749 amino acids long. The translation 
initiation site at position 215 corresponds to 
approximately 75% of a Kozak consensus sequence. The 
results obtained in this experiment led to the 
transcription initiation site being determined exactly 
10 by primer extension in another experiment (Example 6b) , 
in order to be certain that the 5' end found is 
actually the 5' end of B345 . The amino acid sequence of 
B345 derived from the cDMA (SEQ ID N0:1) obtained in 
this cloning test is shown in SEQ ID NO: 2. 

15 b) Second cloning test; determining the 5' and 
promoter region of B345 

Using the Promotor Finder DNA Walking Kit (Clontech) 
and subsequent primer extension reaction, the 5' region 
and the promoter region and also the exact 

20 transcription initiation site were determined. The 5' 
region was amplified using a genomic DNA library 
produced by Clontech with B345 specific primers (SEQ ID 
NO: 38 or nested SEQ ID NO: 39) and Adaptor Primer in the 
kit. To determine the exact start of transcription, the 

25 Primer Extension reaction was carried out. To do this, 
the primer SEQ ID NO: 40 was labelled with 10 U of T4 
Polynukleotide Kinase (Promega) and Sjal of [Y-"^^P]A*rp 
(3000Ci/mmol) at the 5' end according to standard 
protocols (Sambrook et al . , 1989). The labelled 

30 oligonucleotide was purified by precipitation. For the 
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primer extension reaction, 10,000 cpm of 
oligonucleotide were used in a total volume of 20 ]il to 
25 pg of Total RNA of the Colo 205 cell line (ATCC:CCL- 
222} . 

5 The RNA of the cell line was reverse transcribed with 
the radioactively labelled primer and applied to a 
10% polyacryamide gel. To determine the exact band 
length a PGR fragment of nt 1000 - nt 1362 was 
sequenced with "^^S labelled nucleotides and also 

10 applied. The fragment of 209 nucleotides resulting from 
the elongation of the reverse primer fixes the start of 
transcription precisely at position 201. In this way 
the B345 sequence obtained in Example 6a was extended 
in the 5* region and a new start codon was located at 

15 position 283. By repeated sequencing even in the 3' 

region, an additional base, compared with the sequence 
shown in SEQ ID N0:1, was found at position 2430, 
leading to a shift in the reading frame and thus 
displaces the stop codon from position 2729 to 2791. 

20 The cDNA (SEQ ID NO: 3) obtained in this experiment ' has 
an open reading frame which codes for a potential 
protein 836 amino acids long (SEQ ID N0:4). 



Example 7 

25 Bioinf ormatic analysis of the function of B345 

The resulting primary amino acid sequence of B345 is 
shown in SEQ ID NO: 4. Analysis of the hydrophilicity 
plot of the amino acid sequence using the method of 
Kyte and Doolittle (1982) shows that the B345 protein 



39 



has two characteristic hydrophobic domains (amino acids 
pos. 1-29 and 666 - 691), which represent a signal 
peptide and a helical transmembrane domain (Fig. 6) . 
This polarised structure indicates that B345 is an 
5 integral membrane protein. The transmembrane helix 

connects an extracellular section about 666 amino acids 
long and a short intracellular section (145 amino 
acids) (cf. Fig. 7). 

The extracellular domain also shows clear signs of the 

10 existence of a CUB domain at positions 220 - 350 and 

signs of 2 possible other CUB domains in the region of 
amino acids 425 - 660. CUB domains occur at various 
proteins^ generally regulated during the development of 
the embryo. In addition;. CUB domains can sometimes also 

15 be found at EGF (Epidermal Growth Factor) -like domains. 
A recent publication (Gerstein et al . ^ 2000) 
demonstrates that proteins containing CUB domains are 
the most markedly differentially regulated proteins in 
C. elegans. Since genes which play a key role in embryo 

20 development also have analogous functions in cancer, it 
can be assumed that the overexpression of B345 in cells 
brings about a change in the properties of the 
substrate adhesion or the extracellular matrix. The 
protein also has 12 potential N-glycosylation sites 

25 which can be found in the predicted extracellular 

domain, which concurs with the predicted orientation of 
the protein. 

With a BLAST hit (E-value: 5.8 x 10"^) for the region of 
amino acids 235 to 282 of B345 it was possible to 
30 identify a complement-activating component of the RA~ 
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reactive factor (RARF) from mus inusculus. The alignment 
is located within the CUB domain 1 of B345. 

The CUB domains 2 and 3 (section 425-535 and 545-660) 
exhibit marginal homology with the human and fugu 
5 procollagen C proteinase enhancer protein (PCOLCE) . 
These regions occur in the part of PCOLCE which 
contains a CUB domain tandem repeat (E-values: 0.5 
(human) and 2.7 (fugu)). CUB domains sometimes occur in 
repeats . 

10 Presumably, the B345 protein forms a B-sheet secondary 
structure, as it is known that CUB domains fold into a 
15 sandwich. 

The intracellular domain (section 691-836) has no 
significant homologies. However, the total C-terminus 
15 aligned with an EST (AW063026) of human ovarian cancer 
cells (82% identity over 124 amino acids) , 

Example 8 

Determining the precise genetic structure of B345 

20 First of all, Bac clones were sought in public data 

banks (BLAST search) which contain the B345 gene. The 
Bac clones Ac068625 and Ac010170 contained a ma3or part 
of the gene. With intron-spanning primers, splice 
acceptor and donor sequences were sought in Colo 205 

2 5 cDNA and genomic DNA was sought as a template. The PCR 
procedure was carried out as follows: Ix 95°C for 2 
minutes, 35x 95'*C for 15 seconds, 68 °C for 3 minutes 
and then kept at 4''C. The PCR was analysed on a 1.2% 



agarose gel and the lengths of the PGR products of the 
2 templates were compared with identical primer 
combinations. It was found that B345 consists of 8 
exons, separated by 7 introns (Fig. 5) . 

The chromosomal location of the gene was determined by 
f luorescence-in situ-hybridisation (FISH) . The human, 
digoxigenin-labelled B345 probe together with the 
biotin-labelled probe of B47a2 (Knight et al., 1997), 
which is found at the sub-telomeric region of the 3p 
arm of the chromosome, was hybridised with metaphase 
chromosomes of two "normal" individuals (Lichter et 
al., 1988). The hybridised digoxygenin probe was 
detected by anti-sheep-Dig (Boehringer Mannheim FRG) 
and rabbit anti-sheep FITC-labelled antibodies. The 
biotin-labelled probe, on the other hand, was made 
visible with mouse anti-biotin and rabbit-anti-mouse 
(TRITC) and by subsequently staining with DAPI . The 
FISH results show that a majority of the metaphases 
have clear signals on one or both chromatides of 
chromosome 3 in the region p21-p23. The co-location of 
the B47a2 (TRITC) probe on the same chromosomal arrfi 
served to confirm the position. 



Tab. lA 
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Tissue 



expressjon 



PBL 



lung 



++ 



O 
m 
m 
m 

m 
m 
m 

o 
nj 
liil 
in 
a 
111 



placenta 



small intestine 



liver 



jkidney 



spleen 



thymus 



colon 



++ 



skeletal muscle 



heart 



brain 
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Tab. IB 







prornyeiocyiic leuKaernia nLou 












lymphoblastic leukaemia MOLT-4 




Burkitt's lymphoma (Raji) 




colon adenocarcinoma SW480 


+++ 


lung adenocarcinoma A549 


+ 


melanoma G361 
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Tab. IC 



O 

m 

m 
m 

o 

HI 

ry 
m 
m 
m 





Expression 


Oesophagus Tumor 


(+) 


Normal Oesophagus 


(+) 


Stomach Tumor 




Normal Stomach 


+ 


Colon Tumor 


+++ 


Normal Colon 


++ 


Rectum Tumor 


+ 


Normal Rectum 


(+) 
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Tab. 2 A 



Tissue 



Expression B345 / Expression B345 / 



o 

m 

o 
ru 
ru 
m 

a 



Lung Adenocarcinoma 



Lung Adenocarcinoma 



Normal Lung 



bis (+) 



Colon Adenocarcinoma 



++ 



Colon Adenocarcinoma 



+++ 



Normal Colon 



bis (+) 



Breast IDC 



Breast 



Hodgkin's Lymphoma 



(+) 



I++ 



+++ 



Spleen 



Testis 
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Tab. 2B 



Cell lines and Tissues 


Expresstoh B345 / GAPPH . 


Colon Adenocarcinoma SW480 


+ 


Normal Colon (Clontech) 


(+) 


Normal Colon (Invitrogen) 


(+) 


Lung Adenocarcinoma A549 


(+) 


Colon Adenocarcinoma Colo 205 


+++ 


PG 102142 Tumor (Colon Ac.) 


+++ 


PG 21900 Tumor (Colon Ac.) 


++ 


PG 7066 Tumor (Colon Ac.) 


+++ 


PG 32389 Tumor (Colon Ac.) 


++ 
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